Searching for Common Sense: Populating Cyc™ from the Web

نویسندگان

  • Cynthia Matuszek
  • Michael J. Witbrock
  • Robert C. Kahlert
  • John Cabral
  • David Schneider
  • Purvesh Shah
  • Douglas B. Lenat
چکیده

The Cyc project is predicated on the idea that effective machine learning depends on having a core of knowledge that provides a context for novel learned information – what is known informally as “common sense.” Over the last twenty years, a sufficient core of common sense knowledge has been entered into Cyc to allow it to begin effectively and flexibly supporting its most important task: increasing its own store of world knowledge. In this paper, we present initial work on a method of using a combination of Cyc and the World Wide Web, accessed via Google, to assist in entering knowledge into Cyc. The long-term goal is automating the process of building a consistent, formalized representation of the world in the Cyc knowledge base via machine learning. We present preliminary results of this work and describe how we expect the knowledge acquisition process to become more accurate, faster, and more automated in the future.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automated Population of Cyc: Extracting Information about Named-entities from the Web

Populating the Cyc Knowledge Base (KB) has been a manual process until very recently. However, there is currently enough knowledge in Cyc for it to be feasible to attempt to acquire additional knowledge autonomously. This paper describes a system that can collect and validate formally represented, fully-integrated knowledge from the Web or any other electronically available text corpus, about v...

متن کامل

Integrating Cyc and Wikipedia: Folksonomy Meets Rigorously Defined Common-Sense

Integration of ontologies begins with establishing mappings between their concept entries. We map categories from the largest manually-built ontology, Cyc, onto Wikipedia articles describing corresponding concepts. Our method draws both on Wikipedia’s rich but chaotic hyperlink structure and Cyc’s carefully defined taxonomic and common-sense knowledge. On 9,333 manual alignments by one person, ...

متن کامل

Cyc: toward Programs with Common Sense. Cyc: toward Programs with Common Sense Motivation: The

Cyc, a massive project to create a knowledge base spanning all human consensus knowledge, is discussed. The project will require the development of a new logic language for expressing knowledge before sets of procedures can be created and the knowledge base itself built. Cyc programmers developed CycL, a unique representation language and inference engine. Inferencing in Cyc at the heuristic le...

متن کامل

A Semantic Portal for Fund Finding in the EU: Semantic Upgrade, Integration and Publication of Heterogeneous Legacy Data

FundFinder is a Semantic Web portal that allows searching for and navigating through information about funding opportunities. This application has been created following a set of techniques and using a set of tools for the upgrade of legacy content to the Semantic Web, including databases and semistructured documents. This process consists in extracting and populating knowledge from heterogeneo...

متن کامل

Knowledge Begets Knowledge: Steps towards Assisted Knowledge Acquisition in Cyc

The Cyc project is predicated on the idea that, in order to be effective and flexible, computer software must have an understanding of the context in which its tasks are performed. We believe this context is what is known informally as “common sense.” Over the last twenty years, sufficient common sense knowledge has been entered into Cyc to allow it to more effectively and flexibly support an i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005